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The self feaming modules on basic staustics are presented as two volumes. 


the first volume and consists of two lessons. Each lesson contains the jearnine 





re objectives, subject material, worked examples, assignments, and 1s 
‘ fearning. In addition, learning is to be supplernented with fixed learning modules 
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expected that students wiil undertake further reading for better underst 
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INTRODUCTION ; 


Statistics 
rhere are two different meanings to the term “statistics 


i) It is used to indicate facts and figures of any kind: health statistics, busin 
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statistics, vital statistics etc. 
ii) It is also used to refer to a body of knowledge known as statistical methods 


developed for handling numerical data in general i.e. collecting, organizing 


summarising and interpreting data. 


Biostatistics 
Statistics arising out of biological sciences, particularly from the ficla of medicine 


Staurstical methods when applied to biological sciences and medicine is referred to us 
Biostatistics. 

Statistics in ecneral consists of (Wo subdivisions: nhl UG statistics and oredichive or 
inferential statistics. | 

Descriptive statistics includes techniques of describing data in abbreviated. symbolic 
fashion. This could be done by: presenting the data as a diagram or lable, oF by using a 
number to represent the data such as an average or mean, standard deviation. e! 
Inferential statistics, makes use of a limited number of observations from a valid sample 


(0 predict or infer characteristics of an entire group or population, 


Nlcasurement : ; 


is essentially the assigning Of numbers to observations accor ding to 


A measurement 


™ Certain rules. All measurements are made according to one of four prinzinal svales, 


Scales of measurement 
The four principal scales of measurement are: 


1. Nominal scale 


fy 


Ordinal seale 


Interval scale 


ae 


Ratio scale 
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SYMBOLS USED IN THE BOOKLET 


sii" take note of 

fe" exercises 
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‘ow, refer to Fixed Leaming Module ( FILM ) 
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ti are sorted | i 
minal scale (classificatory scale) = Observations Ne a Broun. ild 
No na P d . 4 
is then counted e€.g. students in YOur h. 
categories, The number in each category bate), ee 
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ig moore four categories according to their blood groupings: A, B. AB a. 
tegory could then be counted. 
number of students in each category : 


Ordinal scale (ranking scale) - Arranges observations into RTOUPS Which on in 
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ordered senes. Information 
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ordinal da‘a, but information as to how much greater than or less than is NOt indice. 
eens cee a. eon aS 10 DOM much greate Bes res owt nprtpece sg 
Household: C, 


€.2. socio-economic status : lower class, middle class and upper class. 








community could be ranked according, to their socio-economic status as low ine, 


middle income and high income categories. . : Bie 
* ‘ P ‘ ‘ . x : 2 ‘ Seek es : \W 
Interval scale - This is a scale in which the intervals between SUCCESSIVE pointe 
‘ v . ‘ * ‘ ‘ nnd 
equal. Measurements made using this scale contains information With regard ty 
much greater than or how much less than a measuremen: IS In Comparison WIth ang 
~~ © &. Measiifethents made'in Fahrenhe; (or Celsius) temperature seale - 


. . . . * } 
Ratio scale - In this scale loo, the intervals between successive POINLS are equal, at h 


scale has an added advantage of having an absolute zero (aS Opposed to an arbitrary a 


~~ 


i 0 > >) : : 
such as ')” Fahrenheit or Celsius temperature), which enables measurements io 
compared, c.g. a person ¢ feet tall could be Slated as being twice as ll as same. 


Who is 3 jcet tall 


Variable - A variable is anything that can be Observed ty Vary, 


Variables copie he CMevOrised as continuous and discrete » 


‘| =. shi ss * ‘ 
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A conti ‘ari : l 
NUOUS Variable can lake any value NM a range yf Values. Ileights in a sample 


medical ri : 
tudents if found (0 range from 4.8m to 172.7em, means (hal a particu 


Student's height can lake any value Within this range 


It can either take a fractional va. 
(172.724 sm, (72. 72cm, 172.7cm), ies | 


AN integer or Whole number (l73em) depending © 
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A discrete variable can Only take a defi 
the variable is an integer e.g, parity : 
900, 525, 558 ete. This will not take a value such as $58.5 


Vanables could also be classified accordin 
and qualitative. 


Quantitative variables are variables m 
See eesesiesneteees “ 


e.g. height and weight. Q 


some quality such as, Sex, religion, ethnicity or nationality. 
ee ee 


nite value in any given interval. Here the value of 


1, 2, 3. Number of households in a community :- 


g to the scale of measurement as quantitative 


ecasured using either the interval or ratio scale. 


ualitative variables are variables classified on the basis of 
—_———— — 


This type of variable can be 
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measured using only the nominal and ordinal scales. The measurement made refers to 


the number (frequency or count) in each calegory. 
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[- DESCRIPTIVE STATISTICS 


LESSON I 


Non numerical methods of describing data 


LEARNING OBJECTIVES 
At the end of the lesson the student should be able to: 


|. Organize the data sel in an array and describe it using the 
minimum value, maximum value and range 


Comment on the uses and limitations of such re-arranged data 


tJ 


3. Summarize the data into a frequency distribution, both 
ungrouped and grouped 

Comment on the uses and limitations of summarized data 
Present data by way of tabulations and diagrams 


Describe data presented in a tabular or diagrammatc fonn 


a A  S 


Comment on the advantages and disadvantages of such 


presentations 


1. Hereisa small dita set 


ee emer 


Figure 1.1 


Number of deaths which occurred per day over a 
period of one month in hospital A. 










Mon Tuc Wed Thu Fri Sat Sun 
cnc tall 


Week 1 l 4 0 J 2 | 
Week 2 3 3 0. 2c | 3 
Week 3 2 oe | Gos 3 aw 
Week 4 wee Tee 
Week 5 a : 






B , * ‘ ' : q ’ ° r | | | 
y examining the raw data, what comments can you make ? 


re The raw cata needs to be first organized in 
eds rst organized in a meaningtul way so that they could 
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be easily understood. The ver ic re- 
J ee ene thee very basic re-arrangement of daia is called en ARRAY 


— 1.1 Steps in making an array 
1. Find minimum and maximum values 


2. Arraage the values either in an increasing or decreasing order of Magni 
I Uda 









Figure 1.2 
minimum maximum 
jo | 
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| Exercise 1.1 | 


1. Waat would you deduce from the re-arrangement of data? 


RS een Aa AOO ROR DOR GO BHT T 62 FOC EWA ED OO O58 FURIES ASEAN ES I re eee ee tae 
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Well !! you couid suzy littl ..........ee ee cree 
® minimum number of deaths per day ts 0. 
e maxinium number of deaths per day ts 5. 


e dvaths per day range from 0 to 5 (range = 5) 





2. Try to describe in words the data presented in table 1. | 
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Summarizing data in this manner becomes difficult when the range in the daty 
set is Wide. In such situations the range has tobe divided inte groups or 
intervals, 


3, Define a class interval 
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eee os Eee ee es ere ee ose me ee oe oe a or a ,eavree ieee whee Bs 2s 4 yyvetee Bee 


(Check your definition with what is piven in this book ) 





| Eve NeErcise ha 3 
|, Let us look at the data set given in tie. 14. 


Does iediffer from the earlier data eet? 


te Peeteéete ew @€ & ee =e 


i a ae ce ae 


IS) oy ie ot ims i a ee ek a ee oe a oe oe oe 
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5 Heighis(em) of a sample at 
Students 1 your baiwi 


DOES) BOT ERS OE EE 9 wD ere wr ee PSR ry INET) PIPES SET) 


160.0 
146,7 
160.0 
154.9 
168.9 
152.4 
161.9 


Ih Pe 


ia7.5 


1574 
ott) 
156.2 
147.3 
152.4 
160.0 
154.9 
157.5 
15,7 


15d. 
156.2 
beast 
154.9 
158.8 
14444 
172. | 
159.4 
rh 





bod. 
bd2,7 
bg 
154.9 
155.6 
160.0 
io 
1594 
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1.3 Steps in summarizing data into a grouped frequency distribution 


!. Find the range in your data set. | 
| mate ARs ma ATS Eicon, tangents. 
2. Divide the range into segments called “ classes " or“ clasa intervala ” 
Oe As there are only 35 observations in your data set you don't need to 


have too many classes. 


Write your class intervals here. 


SVAN ORC OES OECD Od & Oia 2 CORK EKO Ce BOL’ elbaboe bee 
cl i a eo eee! OES We EEN: OY 8 Ob KE ies Mbt Eo ¢ Cie 0 oa ie ck Kae ve bab ke 
Ec ee ere DONC SREN ONE EME ee ENS ch hb ee bes cod oodc de hihnk bak cdic ek 


Ve Te tee eRe OEE ES OVER ECVE ST Og ONE OBOE bee eRe bMD OoLh O buwica cs 


aCe Cee een Oe Aue Ee keV Us ee'e hs hed 064 60k Coo. 6 bo 0 cok, Br. 
ar PON SORTA Ae ME ROR OWE ES CUN Sor y ne hbn bes 60k 0656 ho dbc prekc, Bawiid. 


tne col aay Ss Re aaa ia 
SPOT eceeueces L te # i / F ‘y 


Sree Re CP eee ORE Ae 6D es ok oe iw bd dae cca 


ee ep 4 @ 
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number of classes could vary between 5 - 12 depending on the size 
of the data set 
* minimum number of classes should be § 
¢ class limits in class intervals should not overlap 
* classes are generally of equal size 
Purpose of summarizing data is lost there are many class intervals 
While, too few may make you lose important Information about the data 
4. Divide the range by the number of classes you choose to have and the u ith of a 
class can be deleniniind. ees 


. List the class intervals in order and write them in column | of figure 5.) 


. Take each observation from your data set and allocate it ty the respective chess 


it belongs to using a tally mark and {il} column 2 of figure 5. |. 


~ 


Make sure that you have included al! values. Check whether hoth “mum.” 


(minimum and Maximum) have been included into the first and the last class 


respectively. 





nail By re-arranging the raw data you are now able to describe the data using 


2. What are the uses and li:nitations of re-arranging data into an array ” 


minimum'‘and maximum values and the range. 
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ES Ces CSV CSS SH LSS ESCH SHS EHS TOOHREEECSAHREKEHKR HOH ESC LSA HS TOSHHEOCEKRKEC CH KEK SSEHO Rec esaesstdhebcserveecedetds BrP See ES 
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Yes ! the data could be organized into a frequency distribution. 


1.2. Steps in organizing data into a frequency distribution, 





Arrange all possibl: values this variable could take (number of deaths per day) in 
an ascending order of marnitude 

If you examine column | in figure 1.3 you would realise that this 

column includes all possible values that fall within the ranee 

Allocate cach observation in the data set using a tally mark, to the 

respective category of the variable they belong to as shown in column 2 


Count the tally marss in euch category to obtain the frequeney in column 3 


Dame det, Bie I 


-Ommun; 
. \% p 


animes 
column | column 2 column 3 
Figure 1.3 
Number of Tally mark Frequency 
Deaths/day (x) (f) 
0 HH 5 
| IK I 7 
y) TAL It 8 
3 IHL I 7 
4 // 2 
5 / ] 


(37 The data has now being organized into a Frequency Distribution. This proy, 


a summary of the raw data. This can be presented in a [frequency distriby 


table as in table |.1 








Table 1.1 


Distribution of number of deaths that occurred per day 


during a period of onc month in hospital A : 
ap aE Tee TS 


Number of deaths / day Frequency (percent) 
fo Se ne ‘ 
: 5 (16.7) . 
7 (23.3) | 
§ (26.7) 
7 (23.3) | 
2 (6.7) : 


f2. 2 
| (2.3) 


30 ( 100) 





| Exerciie 1.2 | 


|. What are the uses and limitations of summarising data © 
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6, Count the tally marks in each class interval and write the class frequencies in 
_ column 3, ; 
EF in grouped frequency distribution the categories of the variable are 
7 in groups as opposcd to what we saw in table 1.1. 
< column | column 2 column 3 
Figure 1.5 ° 
Frequency 
Eee ae iy . ; 
Phite'~ oe Pe - . A : 
BY aa r¢ey fst - 
daz ite : es sj Peel . My 
7. Present the heights you summarized above tn the form of a table. 
. Table 1.2 
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8. List the advantage of grouping data in a frequency distribution? 
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1.4 A table should have the following characteristics: 


= 


1. A title, clear and to the point. 


2. Anumber. 
3. Headings for cach row and each column. 
4. Units of measurement spccified cither in the title or row © column 


heading, as appropriate. 


5. Totals for both the frequencies and percentage frequencies. 


A table has to be self-explanatory 


Check the table 1.2 for the above features. 


1.5 You have now learnt to organize and condense data. 


The data has now: 


become concise without loosing the details 

brought out its pattern of variation 

become simple and meaningful cnough to form impressions 
become helpful in further analysis 


been presented in a form that needs few words to explain 


Data can be presented in many other ways that arouses the interest of the reader. 


1.6 Niethods of data presentation 
You have already presented the dala on the number of deaths in a hospital and heights o! 


students in your batch in the form of tables. 


The same data could also be presented by a diagram 





1.6.1 Tabulation ee 
ith i enc 
In a frequency distribution table the variable is presented with its frequency 


percentage frequency. 


Table 1.3 


Distribution of number of deaths 
that occurred per day during a 
period of one month in hospital A 








Number of Frequency 
deaths/day (percent) | 
0 5 (16.7) | 
723.23] | 
i $ (26.7) 
3 2 (3.9) 
4 2 (6.7) | 

5 1 (3.3) 





: ! 
Total 30 (100) : 


By examining the “requencies and percentage frequencies the pattern of deaths in hospital 


A could be described. 


Percentage frequencies also become important when 4 comparison has to be 


made on vecurrence of deaths between two hospitals. 


By louking at tabie 1.3 the frequency of deaths in hospital A. could be described 
as follows: 
* imeight out of 36 days ie. 24.726 of tie ime there had been uve deatitn pes 
day in hospital A. As this is the highest lrequency seen in this data set we can 
siy that, majority of the time there had been 2 deaths per dav 
© 3 deaths and | death per day had occurred with equal trequeney 
1c. 23.3% of the time 
© Sixteen point seven percent (16.7%) Of the time on 5 duvs. th h 
: dS, (here 
deaths 


us been no 





Look at your table 1,2 on the frequency distribution of heigh, | 
distribation of heights ¥ and 
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adie 1.4 gives the frequency distribution of deaths SECON th hagn) 
| | Spital R 
‘ 
eee Pt 
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s et 
Table t.4 sey 
Distribution of | 
stribuhon of number of deaths thai Occurred 
day during a period of on i a 
e month in hospital p 
: ‘ 
Number of | 
Number of Frequeney : 
~*~ * 
deaths day (percent) | 
> 
Q § (16.7) 7 
| 3 : 
9 (30,0) 
. . 
) 
3 5 (10.7) | 
s ‘ 
& | (3,3) 
etal eee ! 
ain Tt | | 
“ou 30 (100) 
SO, SRT ATTEN Set 
Compa e the trequenev af de ' 
requeney of deaths in hospital A&B. 
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Table 1.5 shows the tabulation of heights of students. This table has certain additional 
features. 


* cumulative frequeney 


* relative cumulative frequency 


By adding up the frequeney of the first Glass interval to that of the subsequent class 
interval, you get the cumulative frequency for the second class interval. This, when 
repeated gives the cumulative frequency for each class interval as given in column 3 of 
table 1.5, Similarly, the relative cumulative frequencies for class intervals are obtained 


by cumulating the relative fi equencies. 


column | column 2? column 3 column 4 


cement ie te re SEL ee ee ee re ee 


Table 1,5 
Mrequeney, cumulative frequency and relative 
cumulative frequency distributions of a group 
of medical students in your batch. 








Meights Frequency Cumulative — Relative 
(em) (percent) lrequency cumulative 
trequency 
140.0-144.9 Ol (2.9) ()| 2.9 
145.0-149.9 QO2 (5.7) 03 8.6 
150.0-154.9 09 (25.7) 12 eee 
[58.0-159,9 13 (37.1) 25 714 
160.0-164:9 05 (14.3) 30 $5.7 | 
165.0-169.9 Re 427) 32 91.4 
170.0-174.9 03 (8.6) 35 100.0 | 
Total TOT : 





Well !, [can describe the distribution of heights presented in table 1.5 as. 
¢ 37% of the students (Which is the highest frequency) have their heights 
between 155.0 and 159.9 em 
e followed by 25.7% of students whose heights are distributed between 150.0 to 
pA FRE eed ae ema and so on 
e looking at the relative cumulative frequency, 71.4% of the students have their 


heights equal to or less than 159.9 cm and so on....... 


1.6.2 DIAGRAMS 


Data presented in a table could be presented graphically or diagrammatically. When 


graphing data, it is customary to indicate the values of the variable on the horizontal or x 


‘axis and the frequency of occurrence on the vertical or y axis, 
There are many forms of diagrams. Some of them arc: 

1. Bar diagram 

2. Histogram 

3. Frequency polygon 

4. Frequency curve 

5. Pie chart 


6. Graph 


Let us now look at how we could present data in table Tt. diagrammatically. 


BAR DIAGRANI 


igure 1.6 


Distribution of number of deaths that 
occured per day in hospital A 


Frequency 





1 2 3 4 S 6 


Naof deaths per day 


* . ry ‘ , . i. . x > . 
The variation in the occurrence of deaths per day is very clearly visualised here. 


1.6.2.1 Steps in drawiny a bar diagram 


1. The length of bars represent the frequency or percentage frequertcy. Each bar’ is 
drawn proportional to the frequency or percentage frequency” ; 
2. The width of bars should be uniform 


wee . £€ 


throughout the diagram 


ae There ae(opaces tween bars. 


This is a simple bar diagram. 
Multiple and composite bar diagrams too could be drawn. 


ay, Refer to the fixed learning module (FiLM) for multiple composite bar 


diagrams. 


Look at your table on the frequency distribution of heights in table 1.2 and 


present this data diagrammatically: 


S 


my 


Hope you realise that the number of deaths per day, is a discrete variable 


and heights, is a continuous variable. A discrete variable may be presented 


by a Bar diagram while a continuous variable could be presented by a 


liistoyram. 


7" ae 


1.6.2.2 Steps in drawing 8 histogram 
|. The hor.zontal axis, X, presents the variable and the vertical mia, rte 
Ss 


or the percentage frequencies. 

> The values of the variable are presented by drawing vertical bars propor. 
frequencies or percentage frequencies. Here the vertical bars are ee _ ' 
another. .-Marking and labeling of the horizontal axis could be done oe 





class limits, class mid-points or class boundanes. 


i A convenient way to draw a histogram would be to draw it on class... 
As the class boundaries overlap automatically the rectangles oie oe 
~ SACen 


ore another. 
Figure 1.7 resents the histogram of the frequency distnbution of heights given :. 
table 1.5. 


Figure 1.7 


Distribution of heights of a group of medical students 
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145 0149.9 18001849 1850 1899 160.0.164.9 


Heights in cm 


140 0.1449 


Figure 1.9 





1.6.2.3 If you join the mid points of the upper horzontal lines of each of the rectangles 


in the histogram by a straight linc you would get a frequency polygon as shown in figure 


| 8. 
In a large data set when you have many class intervals with narrow width the frequency 


polygon takes the form a smooth curve as in figure |.9. This is called a frequency curve. 


a Frequency distribution of acontinuous variable is diagrammatically present 
a ST EE NE EL PII IS LIE IE ELIE DEI TA EO GOO TS IES 


by a histogram, while a discrete variable is presented by a bar diagram. 
enemas elses 


We have looked at some forms of diagrammatic data presentations. There are man 


more. 


1.6.2.4 Pie diagram or pie chart is another form of pictoral presentation of data. 


dl Like cutting up a pic into picces, in a pie diagram the frequencies or the 


percentage frequencies of a variable is presented by dividing 360° of a circle. 


proportionately. 


49. . 
. ¥ - we 





Wirrrew. 
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ie? Refer FiLM, for an example of a pie-diagrams. 


1.6.2.5 Graph 

A graph is a diagrammatic method of showing quantitative data using a co-ordinate 
system, x and y. Generally, thc variable assigned to the x axis is the independent 
variable and the variable assigned to y axis is the dependent variable. The graph shows 


the change in the value of variable in y axis with respect to the change in the variable in 


the x axis. 


er" Certain basic rules should be followed when drawing a graph to avoid creating 


a misleading visual impressions. 


Following are some important points to observe when drawing a graph. 
|. [tis important to draw a graph to scale, and to start the vertical axis from zero. 


2. It is traditional to draw the vertical axis 75% in length of the horizontal axis. 
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Like for all other types of presentations, a graph should ve, 
e simple 
e sclf-explanatory 


e and have atitle 


ee g Refer FiLM for graphs. 





What do you think are the advantages and disadvantages of tables vs diagrams a5 method 


of data presentation? 
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Both tables and diagrams have their own advantages and disudvantages as methods of 


presenting data. 


* ‘Tables are more scientific as they are precise. 


* Diagrams creates a better visual unpact. 








I - DESCRIPTIVE STATISTICS 


LESSON 2 


Numerical methods of describing data. 
LEARNING OBJECTIVES 


Atthe end of the lesson the student should be able to: 


|. Explain the following summary measures and their application 
in medical science. 
bid measures of central tendency ( averages ) and measures of 
location. 


AY. measures of dispersion. 


tJ 


Calculate the above measures tor a given sct of data. 

3. Select an appropriate measure of central tendency to describe 
a given set of data. 

4. Explain the use of centiles. 


5 Comment on the uses and limitations of such measures and 








their advantages and disadvantages. i" 


I SS 


SS 


| 
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2. Numerical measures 
Biological measurements are subjected to variability. To describe a data set appropriate 


Sumunary measures are adopted. These summary measures are of two types 


Those that measure: 
I. the central point in a data set 


OD LLIN ES 
a ee ed 


et and are referred to as measures of of central tendency 
and those that measure any other point in a data set are referred to as measures 


EP 8B ak tS TREATED 


of location 
variability within the data set and are referred to as measures of f variability, — 





variation or dispersion 
I. Dee TEE A ee RR Ae Oe 


2.1 Measures of central tendency 
These measures indicate the central point around which all valucs of a variable are 
arranged. Hence, this is a single value representative of all values in the data set. There 


are three measurcs of central tendency. 


They are: 
2.1.1 Mean 
Se 


2.1.2 Median 


2.1.3 Mode 





2.1.1 Mean 
There are different types of means of which the arithmetic mean is the most popular and 


widely used. There are two types of arithmetic means; 
simple arithmetic mean 


weighted arithmetic mean 


i. 
li, 
Simple arithmetic mean (x) 


i. 
This is the sum of all valucs of a variable divided by total number of values observations. 


= lL formula 2.| 


~- 
- oman a 6 ee eS ae ce 





ee 
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x1 to x, are the values taken by the variable x. Hence, x; refers to all values of the variab 
X. Sigma (2°) is the operational term which indicates adding up of all values of x. The 


total number of values for variable x ign (number of observations). 








Marks scored at a continuous assessment cxanination by a group of 10 randomly 


selected students arc:46 S++ 76 38 See a 68 S52 00 


fo 4 


——O Fy. 
———— nt 


ber of drugs given to patients attending an OPD is given in table 2 | 
Number 


Table 2.1 
Distribution of number of drugs 


given per patient in an OPD ° 








Number of drugs Frequency 
(x) (f) 
0 8 
| | 42 
2 8 
2 
4 0 
eee 
Total 60 


Using the above sata calculate the avcrage number of drugs given to a patient 
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P Weighted arithmetic mean 






figure 2.1 shows three groups of children. The number and the mean weight‘of children 


: 


b each group ts given, 


Figure 2,1 





. Can we say that the mean weight of all the children are: 37.2 + 34.04 41.3 
: 


> 
= 37.5kg? 


spect of the three groups of children. 


| No, here the weights given arc in re 
en and, all children in the 3 groups can be 


_ Total weight for each group of childr 
; calculated as, 


Total weight in group A - 37.2x 120 = 4464 


-- 34.0x82 = 2788 
= 4[.3x 126 = 5204 

4464 + 2788 = 5204 = 12456 

129 - B2 + 126 = 328 


Total weight in group B 
Total weight in group C 


Total weight oF all children 


Number of children in groups A, B, and C = 


| , Communit y Sircam, F/M, Colomba 


If we divide the total weight by thc total number of children, we get the mean weight for 
the whole group as 37.98 kg. | 
This can also be represented as, 


| _ wx 
NPWS Pons ccecieds WX, i ‘" 
7 = __ la 23 sag 
Wee eae Jormula 2. 
1 7 7%) | eeocrscrmucece k Ww, . 
iz 


A set of n observations consist of k different groups where “ ands w; are the average and 


frequency for each group (i =1,2,.....k). In calculating the mean for the entire group, | 


average for each group | Is multiplied (weighted) by the number of observations 
(frequency) in each group. ot 


How is the mean calculated on ‘data given in ite form of a grouped frequency 


oo distribution? Let us look at the example given below. 





‘ 


_ The birth ere of 470 infants born i in » hospital A = one year is given in table 2.2 


Find the mean birth’ weight of the infants. 





Table 2. Me 
Distribution of infants 
by their birth weights 

















Birth weight No. of 
ofinfants — infants 
(kg) (f 
’ 20eoe | 15 









25-29 — 6 
3.0-3.4 190 
3.5 -3.9 - 

40-44 28 
45+ 6 


Total 



























eB] 


oo, 





‘Si \ 


When data are grouped into class intervals the individual weights of 470 infants or the 
raw data are not available. We assume that the average birth weight for each class is the 
mid-point of that class interval. The class mid point multiplied by the frequency of 
observations in that class will give a value which would approximate the sam of the 


Original birth weights of infants falling into that interval. - . = 


2.1.la Steps in calculating the mean fon grouped data 








l. Find the class mid-point for cach class interval. The class mid-point is considered 
as the average value (x), for that class interval. | 
2. Multiply the class mid-point by its frequency (0. This would give ‘the estimated 
Set for the group. Complete the columns ssa in ve table 2. 3. 
column 1 column 2 xk column 4 | 
yf 
Table 2.3 ; 
Distribution of infants by their birth weight : 
Birth weight No.of - Class mid _ 
of infants infants = point . |. 
(kg) () (Cy 
2.0. +=2.4 a oe ae 
2.5 -2.9 46 ~~ 22st 2.572 
= 3.0 -3.4 190 eee 
3.5°°3.9 135 ila 2 are a a 
4.0 -4.4 28 i ae 
4.5 + 6 Wee Be 
cae eee 
Total 470 Sie alee ze 
- 4 
3. Sum up column 4 (Sfx). 
4. Divide X{x by the total number of observations (n). This gives the mean weight of 


infants born in hospital A. 


mince Ce bal 








3 This mothod is a short cut method to calculate the mean when the dats set, 
large. This method has been used to calculate the mean before Calculators Can, 


into uss, 


Now you know how to calculate the mean when data are given in a series of Values ang 
when summarised into a frequency distribution (groupcd and ungrouped). 


2.1.2. Median 
The median is the value which occupies the middle position when al! Observations are 


arranged i inan:uwray, either ascending or descending. 


(err) 


Erythrocyte seclimentation rates in (mm) of 7 subjects are given below: 
7, 5, 3, 4, 6, 4, 5 : 


2 | ee ot bk irveetterenietvnncietrceraies.. 
OR AIF rrinrorsiectsetsrincompusmmmmn cnn enti 
What is the median?............... 


2.1.2a Steps in calculating the median 


[. Arrange the values in an array. 


2. Number th» observations in seria] order as follows. 


f 
3 5nd 4" 4th fh oth ath 


5 404 5 5 6 9 


As there are Only 7 observations. the middle observation can easily be located: 
ed to 






“oweve:, when the n+ 1)/2 can be us 


data set iy large, the expression, 


fiy 
and the the Che serial {humber of the Median value. 


3, ( 
Use the formula LO find the Median 
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ik ores beatin pV 











Erythrocyte sedimentation rates o 8 subjects are given below: 


te 


Soe eeceerwrtscetesevneseeewevrusecevssa« 0 cae Gm 


two middle numbers. © 4 


The equation when applied to this data gives the following 8+1/2 = 4.5. This indicates 
that the median igslda cater -beiween the 4th and Sth observations. The median is thus the 
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Median = pa @Z=DC 7 F)C formula 2.4 


where, ‘L, is the lower class boundary of the median class 
43 


a g::, the ‘otal number of observations 





cumulative frequency of the class above the median class 


i, Trequency of the madian class 








C, width of the median class 


rhs tus look at haw we could apply formula 2.4 to find the median of the data on birth 


«Weights of infants given in table 2.2. 


3 
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2.1:2b Steps in calculating the median from grouped data 
l. Identify the class interval to which the median belongs. 
In order to find the median class, use the expressior f(m+)/2) 471/2 = 235.5 ), i.e. both | 
235" and 236" observations are middle numbers. Next is to see to which class these two 
numbers belong. In order to find that it is necessary to find the cumulative frequency. 


to which the median belongs. 


column | column 2 column 3 












Table 2.4 
Distribution of infants by their birth weight 





Weight of No. of Cumulative | 
infants _ infants frequency 
ke ‘(F) | 
* 20-24 15 ee | 
25-29 ~ 9 eseesatcld: 
3.0- 3.4 190 ea! 
35-39 3s a 3 
$0 4.4. | a — | 
‘a. 6 ee. 5 FS | 





Mode is the mostcammonly occurring value in the data set or it is the value tha! occurs 
with the greatest freauiency. Ina frequency curve mode refers to the highest point. 


a3 


(13) 
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‘able 2 2.5 shows the distribution of drugs given per patient in an Out Putient Depansnent 


et What is the mode in the following distribution 


i: 
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Table 2.5 


Distribution of number of drugs 


given per patient in an OPD | 





Number of drugs Frequency 
(x) (f) 
0) 8 _ 
42 
2 8 
3 2 
ct 0 


We have already learnt that an average is a single value representing a group of 
values. Cora value to be a "good" average it should sausty certain criteria. 


An average should: 


* be easily understood 

*~ be simple to compute 

* — be based on all values 

* not be unduly affected by extreme values 
be defined by a mathematical formula 


* be capable of Further algebraic treatment 


4 
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Taking each of the above properties into consideration, do you think all threc averages” 


are equally “good” in representing the data? Let us try to understand this by looking at 


some examples. | 





Haemoglobin levels (g/dl) of 26 normal children are as follows: 





11.8 AOR +. -: 108°, 12.4° - 11.6 13.2. 13.8 - 
11.4 ». A22>%* 25°: .- 117 12.6 142 — 22 
10.4 12.9 10.5: 27 132 13.5 «1G 
23, . WW2 -* 12.2 12.9 13.0 


If you Calculate the mean, median and mode for this data, you will realise that all three 


averages have the same value 12.2 g/dl. Hence, any one of them could be used as a 


value representing the data. However, referring to the properties of a“ good” average, 
mean is easily understood and simple to compute. Neither arraying of data as required 
for calculating the median nor grouping of data as required for locating the mode are 
necessary to calculate the mean. Mean is based on all values in the data sct including 
extreme values, whereas both the median and the mode refer to one value in the data 


Set. Being defined by a mathematical formula it can be subjected to further algebraic — 


treatment when compared with the median and the mode. However, the mean has a 


limitation when used in representing data. 


What do you think is the limitation ? . 





A health planner wants to open a new unit in hospital B and needs to know the average 
duration of stay by patients in this hospital in order to calculate the optimum number of 
beds necessary. He has studied the duration of stay of patients admittcd during the past 


ene month to this hospital and. the data obtained is given in table 2.6 


Table 2.6 
3 4: § 


Duration (indays) [ 2 
No. of patients 20 77 30 21 15 





1) Calculate the different averages, 
2) Choose the value that you would use it you were the health planner. 

3) Give reasons for your choice, =o 
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Exercise 2.9 


Monthly income of 11 employees in diftcrent pecupations working in a private sector 


Organization is as follows: 


Rs.12,000/= Rs.9,000/= Rs, 2,000/= Rs.3,000/= 
Rs. 10,000/= Rs.6,000/= Rs.25,000/= Rs.9,500/= 
Rs. 2,000/:= Rs.3,000/= Rs. 3,000/# 


The mean, median and mode when calculated works out to: 
mean = Rs, 7,681.82, median = Rs,6,000/=, mode Rs.3,000/=. Which of the above threc 


measures do you think is the best to describe the monthly income of employces? Give 


reasons for your choice. 
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2.2 Measures of location | 
Averages or measures of central tendency locate the center (mid point) of the distribu, 


of a variable, (t would provide more information if the data is also described tiv, 

| values 0 of the variable at other points. In order to locate these points it is necessary 
arrange the values in an ascending order of magnitude or draw a frequency curve. Ret 
page 19). | 
Commonly used measures of location are : | = : | | : 

1 _Quartiles | | 


ii Deciles 


ae 





ili Centiles or Percentiles 


i. Quartiles | | | Figure 2.2 
Quartiles divide a distribution into four equal 


parts. There ar: three quartiles. 









ii. Deciles | Figure 2.3 
‘Deciles divide a distribution into 10 equal parts. 


There are nine deciles. 





Qr Qr Q1Q1QsunQ-Q: Qy 





iii. Centiles or percentiles | 
Centiles or pe:centiles divide the distribution into 100 equal parts. The median is the 
50th percentil:. Percentiles are good measures of locations. They are used to describe 


the growth in children. 
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3 Measures of Dispersion 


Fccribe the variability observed. 
{ Example 2.3 

FHaemaglobin levels of two groups of children are: 

I. Group I: 12.1, 12.2, 12.8, 12.9, 12.3, 12.4, 12.7; 12,6, 12.5 
Group Il: 12.1, 12.3, 11.7, 11.9, ial, 13.3, 12.5. 








12.9, 12.7 


The range and avcrage for the above data is as follows: 


i . Group | Group I 

a No. of observations 9 9 
Average | 12.5: 125 
Range ~ | 0.8 1.6 


4 his tells you that the average alone is not good enough to describe data. The dispersion 


or scatter of the data too, has to be looked at when describing data. 


q ; 23 Range | 
4 232 Inter-quartile range 
F 23.3 Mean deviation 
234 Standard deviation 
2.3.5 variance | 
: 2.3.6  ~ Co-efficient of variation 
12.3.1 Range 





7 You already know what range ts. Itis the difference between the largest and the smallest 


- Values in the data sct. Look at the haemoglobin levels of children in Group | and Group 
Hin “xample 2.3. The range could be given as follows: 
Group | - 12.1 to 12.9 and / or 0.8 


Group Il - 11.7 to 13.3 and/or 1.6 








4 
‘ 


me” Range is the simplest measure of dispersion. It only tells the minimum and 
ce ee 


maximum values and that all other values are placed between them. 





Figure 2.4 


2.3.2 Inter-quartile range 
You already know what a quartile is. The 
inter-quartilerange is the range within 


which the middle 50%: of thevalues lie. 





2.3.3 Mcan deviationThe mean deviation is a better measure than the above two. It is 


the arithmetic average of the deviation of observations from the arithmetic mean. The 


formula to calculate the mean deviation could be given as: he 
aguas ieee L(x-X 
Mean deviation =~ _E(r=3) 


Jormula 5 
on | 


re 


As observations are distributed both above and bclow the. mean, the differences 
of each observation from the mean may be either a positive Or a negative value, 


the sum of these differences equaling zero. In calculating the mean deviation, all 
ee 


| 
ifferences arc considered as positive. _ | 


Steps in calculating mean deviation ; 


1. Calculate the mean for the data 


2. Find the difference of each observation from the mean (ignore the sign, take all 


differences as plus) 
J. Add the differences, 


- 4. Divide by the number of observations 


s 


Though the mean deviation is a belter measure 


of dispersion than the range or 
the inter-quartile baile it is ‘not uscd in statistival analysis in drawing 
inferences. 


3R | ( IN j 


Example 2.4 


The Erythrocyte Sedimentation Rate (ESR in mm ) of 8 normal individuals are: 
4, 4, 5, 4, 3, 4, 5, 3Find the mean deviation.!. Enter the data into column | as 
shown in table 2.7 


column 1 column 2 


Table 2.7 
Erythrocyte sedimentation rate and the 


deviation of each observation from the mean 





ESR | Deviation from 





(x) mean (x-x) 
4 | 
4 
5 
4 0 
3 
= oe 4 0 
5 1 
3 


Total 32 | 4 


2. I-ind the mean. 


yet Bl, 
n 8 
3. Subtract the mean from each valuc as shown in column 2. 
4. Add all differences. 


> (x-X)= 4 


a7 


Ny 


5. Divide the answer you observed in step 4 by the total number of observar; ons 


X(x-x) _ = = 0.05 


a, 
mean deviation = 0.05 


2.3.4 Variance and 


2.3.5 Standard deviation (SD). 
Variance and standard deviation are better measures than the mean deviation and x 


most conimonly in statistical analysis. Formula to calculate the variance and st 


deviation could be given as: 






Jormula 2.6 





Varuee = 


ee 


| . = 
(s andard deviation mea 
a | nal « 


Steps in calculating the standard deviation and variance 







formula 2.7. 


1; Enter the observations vertically downwards in a column. 


2. Find the mean. | 
Find the difference of each observation from the mean and enter alongside ' 


each observation in a second column. 


= when calculating the standard deviation and variance the sign of the devii 





has to be taken into account. 





4.  Squuure the differences of cach observation from the mean and enter in a- 
column alongside with the differences. 

5. Add the squared valucs to get the sum of squares. 

6. Di-ide this sum by the number of observations minus one lo vet the mean sq 
deviation. This is called the variance. 


7. Fird the square root of this variance to get the standard deviation. 


oe RETy Sa ee ROE CBN oY Tee tC nn Te 


40 








Variance and Standard deviation can be expressed by another formula: 


Variance = eo | 
| — Sormula2g 


Standard deviation <= —_—_——___ formulaz9 


A large standard deviation indicates that the values af the variable are widely 


spread out from the mean, A small standard deviation means that the values are 


scattered closely around the mean. 


Exercise 2 2.10 





Respiratory rate in 9 patients are: 
23, 22, 20. 24, 16, 17. 18, 19, 2 
Find the standard deviation and variance of the respiratory rates. Use table 2.8 


= 





Table 2.8 
Respiratory rate, deviation of each value from the mean 
and the squared deviation 


Respiratory rate Deviation Squared deviation 
(res. / min. ) from means (x -x)" 
(x -X) | 
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— Find the 
intelligence quotient (IQ) of 5) boys is given below 
variance. Use both formulae. 
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— = When data is given in the form of a frequency distribution, rem 
value taken by the variable in each class interval (mid-point) hae , 
multiplied by the frequency. Each deviation too has to be Multipl; 

frequency. 


? oo 
§ 


: Variance = Ata) formula 2. 


Standard deviation = f(x — x) 


Sormula 2) 
n-l 





Ly? f= (Ltxy’ 
n 


V ariance — —— 
n- TI 


22° (-(2 iy 
n 





S tandard deviation 





| 

turmuia 2 1% 

n- |] 
; 

; 

| 


8.3.6 Coefficient of variation (CV) 


It ist measure used to compare relative variability. [tC may be the variation y | 


characteristic in'two or more different series, ¢.g. variation of pulse rate among 
FR, eset = or i 


aul «td. It can also be used to measure two different characteristies in the s: 





Co-efficient of variauion is a measure that compares variability 





irrespective of the unit of measurement, e.g. pulse rate and : 





blood pressure (mm af Hg). 


CS a 2D 
. xX [00 (“" 








DEFINITIONS 





"4 
hays interval - an interval of values into which a range of values of a variable has been 
a divided 


Fass limit = the end numbers of a class interval. A class interval has two class limits: 
‘ Lower class limit and upper class limit | 


true class limits of a class interval 


¥ 
a3 
f P 
{ 

x 

; 


the distance from the lower class boundary to the upper class boundary in 


a class interval 









| Percentage frequency — frequency of a class when presented as a percentage of the total 
t nme 


4 Independent and dependent variables — When an association mene two variables is studied, 
; the variables may be referred to as the independent and dependent variables. The vaqjable that ig 
3 u scribe or measure the lem_under is called the dependent variable. The 


' variables that are used to describe or measure the factors that are assumed to cause or influence 
the problem are called the independent variable. In a study of the relationship between smoking 


and lung cancer, “suffering from lung cancer” would be the dependent variable and “smoking” 
i the independent vanable. | | 
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ANSWERS TO EXERCISES 


2.1. 45.6 
2.2. 1.07 drugs 
243. J2agkg 
2.4, Smm 
2.5. Smm 
2.6. 2.95 +(470/2-111) x .5 = 3.28kg — 
190 
2.7. 1 drug oe ~ 
2.8. i) mean= 3.57 days, median = 3 days, mode = 2 days 


ii) 2 days (mode) 
2.10. variance ™ 7.5, standard deviation = 2.74 
2.11 variance * 1394.45, standard deviation = 37.34 





Cem ee LES 


bd 


CV is expressed as a percentage and is given by: 





CV = SD X 100 


Mean 









formula 2.14 






Example 2.5 


In a group of boys the mean systolic blood pressure in mm: Hg. was 120 and SD was 10. 
In the same group of boys the man hcight and SD was 160 cm and 5 cm, respectively. 
Which characteristic shows greater variation? | 


CV of blood pressure = 10 x.100 = 8.3% 
C V of heights = 5 x 1M = 3.1% 
160 : 


Blood pressure is aia to SnOW 2 greater variability. i.e. 2.7-times that of the height * 


(8.3/3.1). 
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Proof of the formula for the binomial variance 


Since the summation j IS OVer les ns 


2 by the Sample proportion D. This thes as 


Annet 3 é a 


Suppose a population of N individuals is classifi . as “success* or + failure; ar 
“i individual define x=1 for a ‘success’ and 0 for a failure. The mean valy 
ty, ta Parc, tite saunas proportion of SUCCESS, 7. i 


Thus pee x= tem _—s cs successes = 
N 


And the population variance of x IS 2 (X - uy? &N 


be x)N Ph a - (2x Pane Fotticd 
a N N°. > | | 


Suppose a sample of N individuals i is taken from the popuiation N individuals. 


nu 
mber of' successes out of n say r is simply the sum of the n values of x. 


So. var(e) = var(Ex) = 5 var (x) = 5 E (1: 7) 


aninle ite 


Var(r) = nz (\- “ 


ained Cy replacing the ponte ation pro} 
Stimated variance of ¢ is rt lp) with 


moenetiines wrilt 
en 
P IS the proportion of failure. 


aS Npq where q= I 
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_ANSWERES TO EXERCISES 


14/52 2, 1/6 
1. 80/200 —_—-2,, 120/200 
54 Students 


32 Soldiers. 


1 15.86% 2. 0.13% 


127.72 and 129.88 


Boys, 80.95 and 101.45 | . 
Girls, 832.16 and 99.44 


0.46, 95% CI. = 0.32 to 0.59 


99% CI. = 0.28 to 0.64 
No, SND=10,  P<0.01” 
No, SND=238, P<0.05_ 
Yes, SND=1, — P>0.05 
042 2. Yes 3. 0.801 
Yes, SND=6.0, P<0.01 
Yes, SND=7.59, P<0.01 


Yes, SND=2,  P<0.05 
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Annex I 
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0.0 
0.1 


0.2 ° 


0.3 
0.4. 
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0.9 


1.0 
1.1 
1.2 
1.3 
I.d 


2.0° 
2.1 
co 
2.3 
.4 


BUNmM OS 


© CO 1 AW 


0.0 


0.07: 


0.0° 


50000 
46017 
42074 
38209 
34458 


30854 
27425 
24196 
21186 
18406 


15866 
13567 
11507 
96800 


80757 


66807 


54799 . 


44565 
35930 
28717 


22750 | 


17364 
13903 


10724 


81975 


62097 
46612 
34670 
25551 
18658 - 


13499 
96760 
68714 
48342 
33693 


23263 
13911 
10780 
72348 
$8096 





49601 


45620 
41683 
37828 
34090 


30503 
27093 
23335 
20897 


Std 


ans 


15350 
bist 
95093 
79270 


65522 
53699 
43633 
35148 


28067 


22216 
17429 
13553 
10444 


79763 


60366 
45271 
33642 


24771 


18071 


13062 
93544 
66367 
46648 


32481. 


22405 
15310 
10363 
6943) 
46148 


49202 
45224 
41294 
37448 
33724 


30153 
26763 
23576 
20611 


(7879 


15386 
13136 
11123 


«93418 


77804 


— 64255 


52616 
42716 


34380 


27429 


21692 - 
17003 
13209 
10170 


77603 


58677 
- 43965 
32641 


"24012 
‘ $7502 


"12639 
90426 


64095 


45009 . 


SISN 


- ZUAI7 


147350 
"99611 
66726 
44274 


3 


48303: 


44523 
40905 
37070 
33360 


29806 
26435 
23270 
20327 
17619 


15151 


12923 . 


10955 
91759 


- 76359 


63008 


— SSSI 


41815 
35625 


. 26803 


21173 


16586 
12374 
*99031 
75494. 


$7031 
42692 
31667 
23274 


~ $6948 


12228 
87403 
61895 
43423 
30179 


20778 
1417] 
95740 
6-1072 
4747) 


te. 


a 


4 


48405 


4445) 
40517 
36695 
32997 


29460 


26109 


22965 
20045 
17361 


14917 
127)4 
10749 
90123 


74934 


61780 
$0503 
40930 
32884 
26190 


20675 
16177 
12545 


96419 © 


73436 


35426 
41455 
30720 
22557 
16411 


11829 
84474 
59765 
41349 
29086 


20006 
(3632 
92010 
61517 
4074) 


43006 
44038 
40129 
36317 


32636 


29116 
25785 
22663 
19766 
17106 


14686 
12507 
10565 
88508 
73529 


60571 
4947] 
40059 
J2157 


25583 


20182 
15778 
12234 
93367 
71428 


5386! 
40246 
29798 
21360 
15839 


* J1442 


81635 
57703 
40406 
28029 


19262 
[Jit2 
83417 
SY0SY 
ST6 


The Standard Normal Distribution 
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